PVT: Point‐voxel transformer for point cloud learning

نویسندگان

چکیده

The recently developed pure transformer architectures have attained promising accuracy on point cloud learning benchmarks compared to convolutional neural networks. However, existing Transformers are computationally expensive because they waste a significant amount of time structuring irregular data. To solve this shortcoming, we present the Sparse Window Attention module gather coarse-grained local features from nonempty voxels. not only bypasses data and invalid empty voxel computation, but also obtains linear computational complexity with respect resolution. Meanwhile, leverage two different self-attention variants fine-grained about global shape according scale clouds. Finally, construct our architecture called point-voxel (PVT), which integrates these modules into joint framework for learning. Compared previous transformer-based attention-based models, method attains top 94.1% classification benchmark 10 × $10\times $ inference speedup average. Extensive experiments validate effectiveness PVT semantic segmentation benchmarks. Our code pretrained model avaliable at https://github.com/HaochengWan/PVT.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interactive Learning for Point-Cloud Motion Segmentation

Segmenting a moving foreground (fg) from its background (bg) is a fundamental step in many Machine Vision and Computer Graphics applications. Nevertheless, hardly any attempts have been made to tackle this problem in dynamic 3D scanned scenes. Scanned dynamic scenes are typically challenging due to noise and large missing parts. Here, we present a novel approach for motion segmentation in dynam...

متن کامل

Learning 3D Point Cloud Histograms

In this paper we show how using histograms based on the angular relationships between a subset of point normals in a 3D point Cloud can be used in a machine learning algorithm in order to recognize different classes of objects given by their 3D point clouds. This approach extends the work done by Gary Bradski at Willow Garage on point clouds recognition by applying a machine learning approach t...

متن کامل

Multi-View Point Cloud Kernels for Semi-Supervised Learning

In semi-supervised learning (SSL), we learn a predictive model from a collection of labeled data and a typically much larger collection of unlabeled data. These lecture notes present a framework called multi-view point cloud regularization (MVPCR) [5], which unifies and generalizes several semi-supervised kernel methods that are based on data-dependent regularization in reproducing kernel Hilbe...

متن کامل

Learning Efficient Point Cloud Generation for Dense 3D Object Reconstruction

Conventional methods of 3D object generative modeling learn volumetric predictions using deep networks with 3D convolutional operations, which are direct analogies to classical 2D ones. However, these methods are computationally wasteful in attempt to predict 3D shapes, where information is rich only on the surfaces. In this paper, we propose a novel 3D generative modeling framework to efficien...

متن کامل

Cloud Computing; A New Approach to Learning and Learning

Introduction: The cloud computing and services, as a technological solution for developing educational services, can accelerate the provision and expansion of these highly useful services. This study intended to provide an overall picture of practical areas of learning services based on cloud computing teaching and learning equipment. Methods: This was a theoretical hybrid research study in whi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Intelligent Systems

سال: 2022

ISSN: ['1098-111X', '0884-8173']

DOI: https://doi.org/10.1002/int.23073